Learning on real robots from experience and simple user feedback
نویسندگان
چکیده
منابع مشابه
Learning from User Feedback for Machine Translation in Real-Time
Post-editing is the most popular approach to improve accuracy and speed of human translators by applying the machine translation (MT) technology. During the translation process, human translators generate the translation by correcting MT outputs in the post-editing scenario. To avoid repeating the same MT errors, in this paper, we propose an efficient framework to update MT in real-time by lear...
متن کاملRRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features
Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...
متن کاملWeb pages ranking algorithm based on reinforcement learning and user feedback
The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...
متن کاملStatistics and Machine Learning Techniques for Real End-User Experience
Real End-User Experience (RUE) is a monitoring approach that aims to measure the end-user experience by providing information on availability, response time, and reliability of the real used IT services. The response time of each user transaction is measured by an analysis of the network communication flows. Several performance metrics get archived to monitor RUE over time. An abstract, general...
متن کاملLearning from Experience with Delayed Feedback
Many important settings in individual and organizational life involve allocating resources between different types of activities with different delays between allocation and results. Examples include factory managers choosing to spend time on production now or on process improvement that may boost output later, and individuals choosing to get a job now or stay in school to get a better job late...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Physical Agents (JoPha)
سال: 2013
ISSN: 1888-0258
DOI: 10.14198/jopha.2013.7.1.08